To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蹄???第詐??蹄???趙???蹄???調 10010010111110110011111100111111001111111001000111100110100011011011110000111111001111111001001011111011001111110011111100111111111001101110001000111111001111110011111110010010111110110011111100111111001111111001001010110010 92fb3f3f3f91e68dbc3f3f92fb3f3f3fe6e23f3f3f92fb3f3f3f92b2
EUC-JP 蹄???第詐??蹄???趙???蹄???調 11000100111111010011111100111111001111111100001011101000101110101011111000111111001111111100010011111101001111110011111100111111111011001110010000111111001111110011111111000100111111010011111100111111001111111100010010110100 c4fd3f3f3fc2e8babe3f3fc4fd3f3f3fece43f3f3fc4fd3f3f3fc4b4
UTF-8 蹄뀜렰렲第詐렰렣蹄뀜렰렲趙뀜렰렭蹄뀜렰렲調 111010001011100110000100111010111000000010011100111010111010000010110000111010111010000010110010111001111010110010101100111010001010100110010000111010111010000010110000111010111010000010100011111010001011100110000100111010111000000010011100111010111010000010110000111010111010000010110010111010001011011010011001111010111000000010011100111010111010000010110000111010111010000010101101111010001011100110000100111010111000000010011100111010111010000010110000111010111010000010110010111010001010101010111111 e8b984eb809ceba0b0eba0b2e7acace8a990eba0b0eba0a3e8b984eb809ceba0b0eba0b2e8b699eb809ceba0b0eba0ade8b984eb809ceba0b0eba0b2e8aabf
UHC 蹄뀜렰렲第詐렰렣蹄뀜렰렲趙뀜렰렭蹄뀜렰렲調 111100001011010010110010111100011000111010111101100011101011111111110000101011111101111011110001100011101011110110001110101101001111000010110100101100101111000110001110101111011000111010111111111100001110000110110010111100011000111010111101100011101011101011110000101101001011001011110001100011101011110110001110101111111111000011100000 f0b4b2f18ebd8ebff0afdef18ebd8eb4f0b4b2f18ebd8ebff0e1b2f18ebd8ebaf0b4b2f18ebd8ebff0e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)