To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 箋??歪??鶯??耳??癲??維??勇??^ 1110001010110011001111110011111110011000011000110011111100111111111010011111001000111111001111111000111010101000001111110011111111100001100111110011111100111111100010001101101100111111001111111001011101000101001111110011111101011110 e2b33f3f98633f3fe9f23f3f8ea83f3fe19f3f3f88db3f3f97453f3f5e
EUC-JP 箋??歪??鶯??耳??癲??維??勇??^ 1110010010110101001111110011111111001111110001000011111100111111111100101111010000111111001111111011110010101010001111110011111111100010101000010011111100111111101100001101110100111111001111111100110110100110001111110011111101011110 e4b53f3fcfc43f3ff2f43f3fbcaa3f3fe2a13f3fb0dd3f3fcda63f3f5e
UTF-8 箋덌쫭歪뺣쳞鶯뺡툣耳놅쮫癲낂굲維꾢짆勇싳펵^ 11100111101011101000101111101011100011011000110011101100101010111010110111100110101011011010101011101011101110101010001111101100101100111001111011101001101101101010111111101011101110101010000111101101100010001010001111101000100000001011001111101011100001101000010111101100101011101010101111100111100110011011001011101011100000101000001011101010101101011011001011100111101101101010110111101010101111101010001011101100101001111000011011100101100010111000011111101100100010111011001111101101100011101011010101011110 e7ae8beb8d8cecabade6adaaebbaa3ecb39ee9b6afebbaa1ed88a3e880b3eb8685ecaeabe799b2eb8282eab5b2e7b6adeabea2eca786e58b87ec8bb3ed8eb55e
UHC 箋덌쫭歪뺣쳞鶯뺡툣耳놅쮫癲낂굲維꾢짆勇싳펵^ 11101111101010001000100011101111101001101000010111101000111000001001010111101011101010111000010011100101101000111001010111101001101110001001101011101100101111001000011011101111101010001000100011101111101001101000010111101001100000101001010111101011101010111000010011100101101000111001010111101001101110001001101011101100101111001000011001011110 efa888efa685e8e095ebab84e5a395e9b89aecbc86efa888efa685e98295ebab84e5a395e9b89aecbc865e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)