To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??泣??遺??鼇??泣ユ?魏??鴉??泣 10010110111010010011111100111111100010111000001100111111001111111000100011100010001111110011111111101010100001110011111100111111100010111000001110000011100001100011111111101001101100000011111100111111111010011110101100111111001111111000101110000011 96e93f3f8b833f3f88e23f3fea873f3f8b8383863fe9b03f3fe9eb3f3f8b83
EUC-JP 夜??泣??遺??鼇??泣ユ?魏??鴉??泣 11001100111010110011111100111111101101011110001100111111001111111011000011100100001111110011111111110011111001110011111100111111101101011110001110100101111001100011111111110010101100100011111100111111111100101110110100111111001111111011010111100011 cceb3f3fb5e33f3fb0e43f3ff3e73f3fb5e3a5e63ff2b23f3ff2ed3f3fb5e3
UTF-8 夜쏄낀泣됬럦遺띤뜑鼇잙돁泣ユ슅魏됱뒟鴉띯뫅泣 111001011010010010011100111011001000111110000100111010111000001010000000111001101011001110100011111010111001000010101100111010111001111110100110111010011000000110111010111010111001110110100100111010111001110010010001111010011011110010000111111011001001111010011001111010111000111110000001111001101011001110100011111000111000001110100110111011001000101010000101111010011010110110001111111010111001000010110001111010111001001010011111111010011011010010001001111010111001110110101111111010111010101110000101111001101011001110100011 e5a49cec8f84eb8280e6b3a3eb90aceb9fa6e981baeb9da4eb9c91e9bc87ec9e99eb8f81e6b3a3e383a6ec8a85e9ad8feb90b1eb929fe9b489eb9dafebab85e6b3a3
UHC 夜쏄낀泣됬럦遺띤뜑鼇잙돁泣ユ슅魏됱뒟鴉띯뫅泣 1110010110101000100110111110101010110011101001001110101111101000100010011110011110001110100010011110101110110110101101101110110110001101100101001110100010101000100111111110101110001001100101001110101111101000101010111110011010011010100101111110101011100000100010011110110010001010100110111110010010111100100011011110001010010001101010001110101111101000 e5a89beab3a4ebe889e78e89ebb6b6ed8d94e8a89feb8994ebe8abe69a97eae089ec8a9be4bc8de291a8ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)