To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?椅ゆ????嚴щ?苑ч?膺??孃??? 1110010011101000100000101110101000111111100010001101011010000010111001000011111100111111001111110011111110011010100011101000010010001011001111111000100110010001100001001000100100111111111001000101111000111111001111111001101101101111001111110011111100111111 e4e882ea3f88d682e43f3f3f3f9a8e848b3f899184893fe45e3f3f9b6f3f3f3f
EUC-JP 蒻れ?椅ゆ????嚴щ?苑ч?膺??孃??? 1110100011101010101001001110110000111111101100001101100010100100111001100011111100111111001111110011111111010011111011101010011111101011001111111011000111110001101001111110100100111111111001111011111100111111001111111101010111010000001111110011111100111111 e8eaa4ec3fb0d8a4e63f3f3f3fd3eea7eb3fb1f1a7e93fe7bf3f3fd5d03f3f3f
UTF-8 蒻れ슜椅ゆ끽栒띠쒀嚴щ씞苑ч댖膺덇탽孃뉗쉮劉 11101000100100101011101111100011100000101000110011101100100010101001110011100110101001001000010111100011100000101000011011101011100000011011110111100110101000001001001011101011100111011010000011101100100100101000000011100101100110101011010011010001100010011110110010010100100111101110100010001011100100011101000110000111111010111000110010010110111010001000011010111010111010111000110110000111111011011000001110111101111001011010110110000011111010111000100110010111111011001000100110101110111011111010011110000111 e892bbe3828cec8a9ce6a485e38286eb81bde6a092eb9da0ec9280e59ab4d189ec949ee88b91d187eb8c96e886baeb8d87ed83bde5ad83eb8997ec89aeefa787
UHC 蒻れ슜椅ゆ끽栒띠쒀嚴щ씞苑ч댖膺덇탽孃뉗쉮劉 1110010110110110101010101110110010011010101010011110101111110101101010101110011010110011101000111110001011100011101101101110110010111110101011001110010111110001101011001110101110011101101100101110101010111101101011001110100110001000101110101110101111101100100010001110101010110101100110011110010110111110100001111110110010011010100001101110101011100101 e5b6aaec9aa9ebf5aae6b3a3e2e3b6ecbeace5f1aceb9db2eabdace988baebec88eab599e5be87ec9a86eae5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)