To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B???v??????B???vB 0011111100111111001111110011111100111111001111110100001000111111001111110011111101110110001111110011111100111111001111110011111100111111010000100011111100111111001111110111011001000010 3f3f3f3f3f3f423f3f3f763f3f3f3f3f3f423f3f3f7642
SJIS-WIN 蜈??節??B瑤??v蜈??節??B瑤??vB 1110010110000101001111110011111110010000110111110011111100111111010000101110101010100010001111110011111101110110111001011000010100111111001111111001000011011111001111110011111101000010111010101010001000111111001111110111011001000010 e5853f3f90df3f3f42eaa23f3f76e5853f3f90df3f3f42eaa23f3f7642
EUC-JP 蜈??節??B瑤??v蜈??節??B瑤??vB 1110100111100101001111110011111111000000111000010011111100111111010000101111010010100100001111110011111101110110111010011110010100111111001111111100000011100001001111110011111101000010111101001010010000111111001111110111011001000010 e9e53f3fc0e13f3f42f4a43f3f76e9e53f3fc0e13f3f42f4a43f3f7642
UTF-8 蜈곫눟節깍숯B瑤뗰쉑v蜈곫눟節깍숯B瑤뗰쉑vB 1110100010011100100010001110101010110011101010111110101110001000100111111110011110101111100000001110101010111001100011011110110010001000101011110100001011100111100100011010010011101011100101111011000011101100100010011001000101110110111010001001110010001000111010101011001110101011111010111000100010011111111001111010111110000000111010101011100110001101111011001000100010101111010000101110011110010001101001001110101110010111101100001110110010001001100100010111011001000010 e89c88eab3abeb889fe7af80eab98dec88af42e791a4eb97b0ec899176e89c88eab3abeb889fe7af80eab98dec88af42e791a4eb97b0ec89917642
UHC 蜈곫눟節깍숯B瑤뗰쉑v蜈곫눟節깍숯B瑤뗰쉑vB 1110100010100101100000011110011010000111101101111110111110111101101100011110111110111101101000010100001011101000111111011000101111101111101111011010011101110110111010001010010110000001111001101000011110110111111011111011110110110001111011111011110110100001010000101110100011111101100010111110111110111101101001110111011001000010 e8a581e687b7efbdb1efbda142e8fd8befbda776e8a581e687b7efbdb1efbda142e8fd8befbda77642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)