To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 鵝??厭??野ν?}v鵝??厭??野ν?}vB 11101010010000000011111100111111100010010111110100111111001111111001011011101100100000111100101100111111011111010111011011101010010000000011111100111111100010010111110100111111001111111001011011101100100000111100101100111111011111010111011001000010 ea403f3f897d3f3f96ec83cb3f7d76ea403f3f897d3f3f96ec83cb3f7d7642
EUC-JP 鵝??厭??野ν?}v鵝??厭??野ν?}vB 11110011101000010011111100111111101100011101111000111111001111111100110011101110101001101100110100111111011111010111011011110011101000010011111100111111101100011101111000111111001111111100110011101110101001101100110100111111011111010111011001000010 f3a13f3fb1de3f3fcceea6cd3f7d76f3a13f3fb1de3f3fcceea6cd3f7d7642
UTF-8 鵝얜젶厭묐젒野ν븤}v鵝얜젶厭묐젒野ν븤}vB 111010011011010110011101111011001001011010011100111011001010000010110110111001011000111010101101111010111010110010010000111011001010000010010010111010011000011110001110110011101011110111101011101110001010010001111101011101101110100110110101100111011110110010010110100111001110110010100000101101101110010110001110101011011110101110101100100100001110110010100000100100101110100110000111100011101100111010111101111010111011100010100100011111010111011001000010 e9b59dec969ceca0b6e58eadebac90eca092e9878ecebdebb8a47d76e9b59dec969ceca0b6e58eadebac90eca092e9878ecebdebb8a47d7642
UHC 鵝얜젶厭묐젒野ν븤}v鵝얜젶厭묐젒野ν븤}vB 1110010010111101101111101110101110100000101010101110011011110100100100011110101110100000100100011110010110101111101001011110110110010101100011010111110101110110111001001011110110111110111010111010000010101010111001101111010010010001111010111010000010010001111001011010111110100101111011011001010110001101011111010111011001000010 e4bdbeeba0aae6f491eba091e5afa5ed958d7d76e4bdbeeba0aae6f491eba091e5afa5ed958d7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)