To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 雕旃諌∬ョ娯煬荵呵ク旃諌∬ョ娯煬荵凩B 11101000101110001001110111010001100010101101000010000001111010001010111010001100111000101110000010001100111001001011100110011001111010001011100010011101110100011000101011010000100000011110100010101110100011001110001011100000100011001110010010111001100110010111110101000010 e8b89dd18ad081e8ae8ce2e08ce4b999e8b89dd18ad081e8ae8ce2e08ce4b9997d42
EUC-JP 雕旃諌∬ョ娯煬荵呵ク旃諌∬ョ娯煬荵凩B 11110000101110101101101011010011101101001101001010100010111010101000111010101110101110001110010011011111111011001110100010111011110100101110101010001110101110001101101011010011101101001101001010100010111010101000111010101110101110001110010011011111111011001110100010111011110100011101111001000010 f0badad3b4d2a2ea8eaeb8e4dfece8bbd2ea8eb8dad3b4d2a2ea8eaeb8e4dfece8bbd1de42
UTF-8 雕旃諌∬ョ娯煬荵呵ク旃諌∬ョ娯煬荵凩B 11101001100110111001010111100110100101111000001111101000101010111000110011100010100010001010110011101111101111011010111011100101101010001010111111100111100001011010110011101000100011011011010111100101100100011011010111101111101111011011100011100110100101111000001111101000101010111000110011100010100010001010110011101111101111011010111011100101101010001010111111100111100001011010110011101000100011011011010111100101100001111010100101000010 e99b95e69783e8ab8ce288acefbdaee5a8afe785ace88db5e591b5efbdb8e69783e8ab8ce288acefbdaee5a8afe785ace88db5e587a942
UHC 雕??∬??煬?呵???∬??煬??B 11110000111001110011111100111111101000011111001100111111001111111110010111001001001111111100101010100111001111110011111100111111101000011111001100111111001111111110010111001001001111110011111101000010 f0e73f3fa1f33f3fe5c93fcaa73f3f3fa1f33f3fe5c93f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)