To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?れ?霓??癲??厭ロ??れ?霓??癲??厭ロ?B 0011111110000010111010100011111111101000101111010011111100111111111000011001111100111111001111111000100101111101100000111000110100111111001111111000001011101010001111111110100010111101001111110011111111100001100111110011111100111111100010010111110110000011100011010011111101000010 3f82ea3fe8bd3f3fe19f3f3f897d838d3f3f82ea3fe8bd3f3fe19f3f3f897d838d3f42
EUC-JP ?れ?霓??癲??厭ロ??れ?霓??癲??厭ロ?B 0011111110100100111011000011111111110000101111110011111100111111111000101010000100111111001111111011000111011110101001011110110100111111001111111010010011101100001111111111000010111111001111110011111111100010101000010011111100111111101100011101111010100101111011010011111101000010 3fa4ec3ff0bf3f3fe2a13f3fb1dea5ed3f3fa4ec3ff0bf3f3fe2a13f3fb1dea5ed3f42
UTF-8 隸れ렰霓낅맠癲딄난厭ロ늹隸れ렰霓낅맠癲딄난厭ロ늹B 11101111101001101011100011100011100000101000110011101011101000001011000011101001100111001001001111101011100000101000010111101011101001111010000011100111100110011011001011101011100101001000010011101011100000101001110011100101100011101010110111100011100000111010110111101011100010101011100111101111101001101011100011100011100000101000110011101011101000001011000011101001100111001001001111101011100000101000010111101011101001111010000011100111100110011011001011101011100101001000010011101011100000101001110011100101100011101010110111100011100000111010110111101011100010101011100101000010 efa6b8e3828ceba0b0e99c93eb8285eba7a0e799b2eb9484eb829ce58eade383adeb8ab9efa6b8e3828ceba0b0e99c93eb8285eba7a0e799b2eb9484eb829ce58eade383adeb8ab942
UHC 隸れ렰霓낅맠癲딄난厭ロ늹隸れ렰霓낅맠癲딄난厭ロ늹B 11100111111001101010101011101100100011101011110111100111111001111000010111101011100100001010110111101111101001101000101011101010101100111010110111100110111101001010101111101101100010001000001011100111111001101010101011101100100011101011110111100111111001111000010111101011100100001010110111101111101001101000101011101010101100111010110111100110111101001010101111101101100010001000001001000010 e7e6aaec8ebde7e785eb90adefa68aeab3ade6f4abed8882e7e6aaec8ebde7e785eb90adefa68aeab3ade6f4abed888242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)