To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 狎??倚?????渦??狎??倚?????畏 11100000101111100011111100111111100110001101111100111111001111110011111100111111001111111000100101010001001111110011111111100000101111100011111100111111100110001101111100111111001111110011111100111111001111111000100011011000 e0be3f3f98df3f3f3f3f3f89513f3fe0be3f3f98df3f3f3f3f3f88d8
EUC-JP 狎??倚??洧??渦??狎??倚??洧??畏 1110000011000000001111110011111111010000111000010011111100111111100011111100011110110100001111110011111110110001101100100011111100111111111000001100000000111111001111111101000011100001001111110011111110001111110001111011010000111111001111111011000011011010 e0c03f3fd0e13f3f8fc7b43f3fb1b23f3fe0c03f3fd0e13f3f8fc7b43f3fb0da
UTF-8 狎녿씒倚롧뛾洧곗뎾渦깆쿈狎녿씒倚롧뛾洧곗뎾畏 111001111000101110001110111010111000010110111111111011001001010010010010111001011000000010011010111010111010000110100111111010111001101110111110111001101011010010100111111010101011001110010111111010111000111010111110111001101011100010100110111010101011100110000110111011001011111110001000111001111000101110001110111010111000010110111111111011001001010010010010111001011000000010011010111010111010000110100111111010111001101110111110111001101011010010100111111010101011001110010111111010111000111010111110111001111001010110001111 e78b8eeb85bfec9492e5809aeba1a7eb9bbee6b4a7eab397eb8ebee6b8a6eab986ecbf88e78b8eeb85bfec9492e5809aeba1a7eb9bbee6b4a7eab397eb8ebee7958f
UHC 狎녿씒倚롧뛾洧곗뎾渦깆쿈狎녿씒倚롧뛾洧곗뎾畏 1110010011100100100001101110101110011101101010001110101111101111100011101110011110001101100001001110101011111011101100001110110010001001100100011110100010111110101100011110110010110010100111011110010011100100100001101110101110011101101010001110101111101111100011101110011110001101100001001110101011111011101100001110110010001001100100011110100011100110 e4e486eb9da8ebef8ee78d84eafbb0ec8991e8beb1ecb29de4e486eb9da8ebef8ee78d84eafbb0ec8991e8e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)