To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 蹄??詐?瓏??迷z蹄??詐?瓏??迷zB 1001001011111011001111110011111110001101101111000011111111100000111110100011111100111111100101101100000001111010100100101111101100111111001111111000110110111100001111111110000011111010001111110011111110010110110000000111101001000010 92fb3f3f8dbc3fe0fa3f3f96c07a92fb3f3f8dbc3fe0fa3f3f96c07a42
EUC-JP 蹄??詐?瓏??迷z蹄??詐?瓏??迷zB 1100010011111101001111110011111110111010101111100011111111100000111111000011111100111111110011001100001001111010110001001111110100111111001111111011101010111110001111111110000011111100001111110011111111001100110000100111101001000010 c4fd3f3fbabe3fe0fc3f3fccc27ac4fd3f3fbabe3fe0fc3f3fccc27a42
UTF-8 蹄ㆁ렱詐렱瓏렠쇤迷z蹄ㆁ렱詐렱瓏렠쇤迷zB 111010001011100110000100111000111000011010000001111010111010000010110001111010001010100110010000111010111010000010110001111001111001001110001111111010111010000010100000111011001000011110100100111010001011111110110111011110101110100010111001100001001110001110000110100000011110101110100000101100011110100010101001100100001110101110100000101100011110011110010011100011111110101110100000101000001110110010000111101001001110100010111111101101110111101001000010 e8b984e38681eba0b1e8a990eba0b1e7938feba0a0ec87a4e8bfb77ae8b984e38681eba0b1e8a990eba0b1e7938feba0a0ec87a4e8bfb77a42
UHC 蹄ㆁ렱詐렱瓏렠쇤迷z蹄ㆁ렱詐렱瓏렠쇤迷zB 111100001011010010100100111100011000111010111110110111101111000110001110101111101101011011101010100011101011000110111100111010011101101010111011011110101111000010110100101001001111000110001110101111101101111011110001100011101011111011010110111010101000111010110001101111001110100111011010101110110111101001000010 f0b4a4f18ebedef18ebed6ea8eb1bce9dabb7af0b4a4f18ebedef18ebed6ea8eb1bce9dabb7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)