To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B}v??????????B}vB 001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001111101011101100011111100111111001111110011111100111111001111110011111100111111001111110011111101000010011111010111011001000010 3f3f3f3f3f3f3f3f3f3f427d763f3f3f3f3f3f3f3f3f3f427d7642
SJIS-WIN 臟棒?臧?臧????B}v臟棒?臧?臧????B}vB 1110010001100110100101100101111100111111111001000110100000111111111001000110100000111111001111110011111100111111010000100111110101110110111001000110011010010110010111110011111111100100011010000011111111100100011010000011111100111111001111110011111101000010011111010111011001000010 e466965f3fe4683fe4683f3f3f3f427d76e466965f3fe4683fe4683f3f3f3f427d7642
EUC-JP 臟棒?臧?臧?獐??B}v臟棒?臧?臧?獐??B}vB 111001111100011111001011110000000011111111100111110010010011111111100111110010010011111110001111110010111011101000111111001111110100001001111101011101101110011111000111110010111100000000111111111001111100100100111111111001111100100100111111100011111100101110111010001111110011111101000010011111010111011001000010 e7c7cbc03fe7c93fe7c93f8fcbba3f3f427d76e7c7cbc03fe7c93fe7c93f8fcbba3f3f427d7642
UTF-8 臟棒툽臧렔臧렎獐쇨텼B}v臟棒툽臧렔臧렎獐쇨텼B}vB 11101000100001111001111111100110101000111001001011101101100010001011110111101000100001111010011111101011101000001001010011101000100001111010011111101011101000001000111011100111100011011001000011101100100001111010100011101101100001011011110001000010011111010111011011101000100001111001111111100110101000111001001011101101100010001011110111101000100001111010011111101011101000001001010011101000100001111010011111101011101000001000111011100111100011011001000011101100100001111010100011101101100001011011110001000010011111010111011001000010 e8879fe6a392ed88bde887a7eba094e887a7eba08ee78d90ec87a8ed85bc427d76e8879fe6a392ed88bde887a7eba094e887a7eba08ee78d90ec87a8ed85bc427d7642
UHC 臟棒툽臧렔臧렎獐쇨텼B}v臟棒툽臧렔臧렎獐쇨텼B}vB 1110110111110100110111001110101011000101111110101110110111110101100011101010100111101101111101011000111010100100111011011110111110111100111010101100010111100001010000100111110101110110111011011111010011011100111010101100010111111010111011011111010110001110101010011110110111110101100011101010010011101101111011111011110011101010110001011110000101000010011111010111011001000010 edf4dceac5faedf58ea9edf58ea4edefbceac5e1427d76edf4dceac5faedf58ea9edf58ea4edefbceac5e1427d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)