To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 維?し維??維?ぜ}v維?し維??維?ぜ}vB 100010001101101100111111100000101011010110001000110110110011111100111111100010001101101100111111100000101011101001111101011101101000100011011011001111111000001010110101100010001101101100111111001111111000100011011011001111111000001010111010011111010111011001000010 88db3f82b588db3f3f88db3f82ba7d7688db3f82b588db3f3f88db3f82ba7d7642
EUC-JP 維?し維??維?ぜ}v維?し維??維?ぜ}vB 101100001101110100111111101001001011011110110000110111010011111100111111101100001101110100111111101001001011110001111101011101101011000011011101001111111010010010110111101100001101110100111111001111111011000011011101001111111010010010111100011111010111011001000010 b0dd3fa4b7b0dd3f3fb0dd3fa4bc7d76b0dd3fa4b7b0dd3f3fb0dd3fa4bc7d7642
UTF-8 維껊し維귣쳡維귣ぜ}v維껊し維귣쳡維귣ぜ}vB 1110011110110110101011011110101010111011100010101110001110000001100101111110011110110110101011011110101010110111101000111110110010110011101000011110011110110110101011011110101010110111101000111110001110000001100111000111110101110110111001111011011010101101111010101011101110001010111000111000000110010111111001111011011010101101111010101011011110100011111011001011001110100001111001111011011010101101111010101011011110100011111000111000000110011100011111010111011001000010 e7b6adeabb8ae38197e7b6adeab7a3ecb3a1e7b6adeab7a3e3819c7d76e7b6adeabb8ae38197e7b6adeab7a3ecb3a1e7b6adeab7a3e3819c7d7642
UHC 維껊し維귣쳡維귣ぜ}v維껊し維귣쳡維귣ぜ}vB 1110101110101011100000111110101110101010101101111110101110101011100000101110101110101011100001111110101110101011100000101110101110101010101111000111110101110110111010111010101110000011111010111010101010110111111010111010101110000010111010111010101110000111111010111010101110000010111010111010101010111100011111010111011001000010 ebab83ebaab7ebab82ebab87ebab82ebaabc7d76ebab83ebaab7ebab82ebab87ebab82ebaabc7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)