To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 鴦?????鷹??v鴦?????鷹??vB 11101001111100010011111100111111001111110011111100111111100100011110100100111111001111110111011011101001111100010011111100111111001111110011111100111111100100011110100100111111001111110111011001000010 e9f13f3f3f3f3f91e93f3f76e9f13f3f3f3f3f91e93f3f7642
EUC-JP 鴦??馹??鷹??v鴦??馹??鷹??vB 1111001011110011001111110011111110001111111010011010000100111111001111111100001011101011001111110011111101110110111100101111001100111111001111111000111111101001101000010011111100111111110000101110101100111111001111110111011001000010 f2f33f3f8fe9a13f3fc2eb3f3f76f2f33f3f8fe9a13f3fc2eb3f3f7642
UTF-8 鴦볛뮩馹긺뼨鷹곴텣v鴦볛뮩馹긺뼨鷹곴텣vB 111010011011010010100110111010111011001110011011111010111010111010101001111010011010011010111001111010101011100010111010111010111011110010101000111010011011011110111001111010101011001110110100111011011000010110100011011101101110100110110100101001101110101110110011100110111110101110101110101010011110100110100110101110011110101010111000101110101110101110111100101010001110100110110111101110011110101010110011101101001110110110000101101000110111011001000010 e9b4a6ebb39bebaea9e9a6b9eab8baebbca8e9b7b9eab3b4ed85a376e9b4a6ebb39bebaea9e9a6b9eab8baebbca8e9b7b9eab3b4ed85a37642
UHC 鴦볛뮩馹긺뼨鷹곴텣v鴦볛뮩馹긺뼨鷹곴텣vB 111001001110110010010011111000101001001010110011111011001111000110110001111001111001011010101011111010111110110110000001111010101011011010011000011101101110010011101100100100111110001010010010101100111110110011110001101100011110011110010110101010111110101111101101100000011110101010110110100110000111011001000010 e4ec93e292b3ecf1b1e796abebed81eab69876e4ec93e292b3ecf1b1e796abebed81eab6987642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)