To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 蒸?貞???屠乙?v蒸?貞???屠乙?vB 1000111111110110001111111001001011100101001111110011111100111111100100110110101010001001101100110011111101110110100011111111011000111111100100101110010100111111001111110011111110010011011010101000100110110011001111110111011001000010 8ff63f92e53f3f3f936a89b33f768ff63f92e53f3f3f936a89b33f7642
EUC-JP 蒸?貞???屠乙?v蒸?貞???屠乙?vB 1011111011111000001111111100010011100111001111110011111100111111110001011100101110110010101101010011111101110110101111101111100000111111110001001110011100111111001111110011111111000101110010111011001010110101001111110111011001000010 bef83fc4e73f3f3fc5cbb2b53f76bef83fc4e73f3f3fc5cbb2b53f7642
UTF-8 蒸렧貞肋렰뀀屠乙렏v蒸렧貞肋렰뀀屠乙렏vB 111010001001001010111000111010111010000010100111111010001011001010011110111011111010010110010011111010111010000010110000111010111000000010000000111001011011000110100000111001001011100110011001111010111010000010001111011101101110100010010010101110001110101110100000101001111110100010110010100111101110111110100101100100111110101110100000101100001110101110000000100000001110010110110001101000001110010010111001100110011110101110100000100011110111011001000010 e892b8eba0a7e8b29eefa593eba0b0eb8080e5b1a0e4b999eba08f76e892b8eba0a7e8b29eefa593eba0b0eb8080e5b1a0e4b999eba08f7642
UHC 蒸렧貞肋렰뀀屠乙렏v蒸렧貞肋렰뀀屠乙렏vB 111100011111101010001110101101101110111111110110110100101111000110001110101111011011001011101011110100111111010111101011111000001000111010100101011101101111000111111010100011101011011011101111111101101101001011110001100011101011110110110010111010111101001111110101111010111110000010001110101001010111011001000010 f1fa8eb6eff6d2f18ebdb2ebd3f5ebe08ea576f1fa8eb6eff6d2f18ebdb2ebd3f5ebe08ea57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)