To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 嚥?????衍??}v嚥?????衍??}vB 100110101000101100111111001111110011111100111111001111111001111110100101001111110011111101111101011101101001101010001011001111110011111100111111001111110011111110011111101001010011111100111111011111010111011001000010 9a8b3f3f3f3f3f9fa53f3f7d769a8b3f3f3f3f3f9fa53f3f7d7642
EUC-JP 嚥?????衍??}v嚥?????衍??}vB 110100111110101100111111001111110011111100111111001111111101111010100111001111110011111101111101011101101101001111101011001111110011111100111111001111110011111111011110101001110011111100111111011111010111011001000010 d3eb3f3f3f3f3fdea73f3f7d76d3eb3f3f3f3f3fdea73f3f7d7642
UTF-8 嚥득굢歷쇘댚衍꾢뤀}v嚥득굢歷쇘댚衍꾢뤀}vB 1110010110011010101001011110101110010011100111011110101010110101101000101110111110100110100011001110110010000111100110001110101110001100100110101110100010100001100011011110101010111110101000101110101110100100100000000111110101110110111001011001101010100101111010111001001110011101111010101011010110100010111011111010011010001100111011001000011110011000111010111000110010011010111010001010000110001101111010101011111010100010111010111010010010000000011111010111011001000010 e59aa5eb939deab5a2efa68cec8798eb8c9ae8a18deabea2eba4807d76e59aa5eb939deab5a2efa68cec8798eb8c9ae8a18deabea2eba4807d7642
UHC 嚥득굢歷쇘댚衍꾢뤀}v嚥득굢歷쇘댚衍꾢뤀}vB 1110011010111111101101011110011010000010100010011110011010111000101111001110011110001000101111101110011011100010100001001110010110001111101100010111110101110110111001101011111110110101111001101000001010001001111001101011100010111100111001111000100010111110111001101110001010000100111001011000111110110001011111010111011001000010 e6bfb5e68289e6b8bce788bee6e284e58fb17d76e6bfb5e68289e6b8bce788bee6e284e58fb17d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)