To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 嚥〓?誼ゆ?喩?6}v嚥〓?誼ゆ?喩?6}vB 1001101010001011100000011010110000111111100010110110001010000010111001000011111110011010011001110011111110000010010101010111110101110110100110101000101110000001101011000011111110001011011000101000001011100100001111111001101001100111001111111000001001010101011111010111011001000010 9a8b81ac3f8b6282e43f9a673f82557d769a8b81ac3f8b6282e43f9a673f82557d7642
EUC-JP 嚥〓?誼ゆ?喩?6}v嚥〓?誼ゆ?喩?6}vB 1101001111101011101000101010111000111111101101011100001110100100111001100011111111010011110010000011111110100011101101100111110101110110110100111110101110100010101011100011111110110101110000111010010011100110001111111101001111001000001111111010001110110110011111010111011001000010 d3eba2ae3fb5c3a4e63fd3c83fa3b67d76d3eba2ae3fb5c3a4e63fd3c83fa3b67d7642
UTF-8 嚥〓맧誼ゆ에喩뽰6}v嚥〓맧誼ゆ에喩뽰6}vB 1110010110011010101001011110001110000000100100111110101110100111101001111110100010101010101111001110001110000010100001101110110010010111100100001110010110010110101010011110101110111101101100001110111110111100100101100111110101110110111001011001101010100101111000111000000010010011111010111010011110100111111010001010101010111100111000111000001010000110111011001001011110010000111001011001011010101001111010111011110110110000111011111011110010010110011111010111011001000010 e59aa5e38093eba7a7e8aabce38286ec9790e596a9ebbdb0efbc967d76e59aa5e38093eba7a7e8aabce38286ec9790e596a9ebbdb0efbc967d7642
UHC 嚥〓맧誼ゆ에喩뽰6}v嚥〓맧誼ゆ에喩뽰6}vB 1110011010111111101000011110101110010000101100001110101111111110101010101110011010111111101000011110101011100111100101101110110010100011101101100111110101110110111001101011111110100001111010111001000010110000111010111111111010101010111001101011111110100001111010101110011110010110111011001010001110110110011111010111011001000010 e6bfa1eb90b0ebfeaae6bfa1eae796eca3b67d76e6bfa1eb90b0ebfeaae6bfa1eae796eca3b67d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)