To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??乙??幽??冶⑤?怡ワ??μ?馭 11100010101000110011111100111111100010011011001100111111001111111001011101001000001111110011111110010110111010001000011101000100001111111001110001111101100000111000111100111111001111111000001111001010001111111110100101100110 e2a33f3f89b33f3f97483f3f96e887443f9c7d838f3f3f83ca3fe966
EUC-JP 筌??乙??幽??冶??怡ワ??μ?馭 111001001010010100111111001111111011001010110101001111110011111111001101101010010011111100111111110011001110101000111111001111111101011111011110101001011110111100111111001111111010011011001100001111111111000111000111 e4a53f3fb2b53f3fcda93f3fccea3f3fd7dea5ef3f3fa6cc3ff1c7
UTF-8 筌㏂끋乙뤷ㅇ幽됯킐冶⑤씮怡ワ㎖類μ깂馭 1110011110101101100011001110001110001111100000101110101110000001100010111110010010111001100110011110101110100100101101111110001110000101100001111110010110111001101111011110101110010000101011111110110110000010100100001110010110000110101101101110001010010001101001001110110010010100101011101110011010000000101000011110001110000011101011111110001110001110100101101110111110100111100100001100111010111100111010101011100110000010111010011010011010101101 e7ad8ce38f82eb818be4b999eba4b7e38587e5b9bdeb90afed8290e586b6e291a4ec94aee680a1e383afe38e96efa790cebceab982e9a6ad
UHC 筌㏂끋乙뤷ㅇ幽됯킐冶⑤씮怡ワ㎖類μ깂馭 1110111110100111101000101110001110000101101111011110101111100000100011111110010110100100101101111110101011101011100010011110101010110100100111001110010110100111101010001110101110011101101111111110110010101110101010111110111110100111101000101110101110111010101001011110110010000011100001001110010111011111 efa7a2e385bdebe08fe5a4b7eaeb89eab49ce5a7a8eb9dbfecaeabefa7a2ebbaa5ec8384e5df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)