To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????}v??????}vB 0011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f7d763f3f3f3f3f3f7d7642
SJIS-WIN 小?妖?舌堯}v小?妖?舌堯}vB 10001111101011000011111110010111011001000011111110010000111000111110101010011111011111010111011010001111101011000011111110010111011001000011111110010000111000111110101010011111011111010111011001000010 8fac3f97643f90e3ea9f7d768fac3f97643f90e3ea9f7d7642
EUC-JP 小?妖炤舌堯}v小?妖炤舌堯}vB 1011111010101110001111111100110111000101100011111100100111010010110000001110010111110100101000010111110101110110101111101010111000111111110011011100010110001111110010011101001011000000111001011111010010100001011111010111011001000010 beae3fcdc58fc9d2c0e5f4a17d76beae3fcdc58fc9d2c0e5f4a17d7642
UTF-8 小숞妖炤舌堯}v小숞妖炤舌堯}vB 1110010110110000100011111110110010001000100111101110010110100110100101101110011110000010101001001110100010001000100011001110010110100000101011110111110101110110111001011011000010001111111011001000100010011110111001011010011010010110111001111000001010100100111010001000100010001100111001011010000010101111011111010111011001000010 e5b08fec889ee5a696e782a4e8888ce5a0af7d76e5b08fec889ee5a696e782a4e8888ce5a0af7d7642
UHC 小숞妖炤舌堯}v小숞妖炤舌堯}vB 1110000110110011100110011111101111101000111011011110000110111111111000001101111111101000111010110111110101110110111000011011001110011001111110111110100011101101111000011011111111100000110111111110100011101011011111010111011001000010 e1b399fbe8ede1bfe0dfe8eb7d76e1b399fbe8ede1bfe0dfe8eb7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)