To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???岳??余??踰??堰??曄?????^ 001111110011111100111111100010100111100000111111001111111001011101011101001111110011111111100110111110100011111100111111100010011000000100111111001111111001111001000000001111110011111100111111001111110011111101011110 3f3f3f8a783f3f975d3f3fe6fa3f3f89813f3f9e403f3f3f3f3f5e
EUC-JP ???岳??余??踰??堰??曄?????^ 001111110011111100111111101100111101100100111111001111111100110110111110001111110011111111101100111111000011111100111111101100011110000100111111001111111101101110100001001111110011111100111111001111110011111101011110 3f3f3fb3d93f3fcdbe3f3fecfc3f3fb1e13f3fdba13f3f3f3f3f5e
UTF-8 琉뷸뒛岳볡쪢余됰옪踰ㅻㅊ堰섆퓫曄쎿썖流⑸콖^ 11101111101001111000110011101011101101111011100011101011100100101001101111100101101100101011001111101011101100111010000111101100101010101010001011100100101111011001100111101011100100001011000011101100100110001010101011101000101110001011000011100011100001011011101111100011100001011000101011100101101000001011000011101100100001001000011011101101100100111010101111100110100110111000010011101100100011101011111111101100100011011001011011101111101001111000101011100010100100011011100011101100101111011001011001011110 efa78cebb7b8eb929be5b2b3ebb3a1ecaaa2e4bd99eb90b0ec98aae8b8b0e385bbe3858ae5a0b0ec8486ed93abe69b84ec8ebfec8d96efa78ae291b8ecbd965e
UHC 琉뷸뒛岳볡쪢余됰옪踰ㅻㅊ堰섆퓫曄쎿썖流⑸콖^ 11101011101001001011101011100110100010101001100011100100101111111001001111100111101001011001101111100101111110011000100111101011100111101010100111101011101100101010010011101011101001001011101011100101111010001001100011100100101111111001001111100111101001011001101111100110100110111000100111101010111111001010100111101011101100011001000001011110 eba4bae68a98e4bf93e7a59be5f989eb9ea9ebb2a4eba4bae5e898e4bf93e7a59be69b89eafca9ebb1905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)