To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 澱???殿???祭??澱???殿???祭?甄^ 100100110110001000111111001111110011111110010011011000010011111100111111001111111000110111010101001111110011111110010011011000100011111100111111001111111001001101100001001111110011111100111111100011011101010100111111111000010100101001011110 93623f3f3f93613f3f3f8dd53f3f93623f3f3f93613f3f3f8dd53fe14a5e
EUC-JP 澱???殿???祭??澱???殿???祭?甄^ 110001011100001100111111001111110011111111000101110000100011111100111111001111111011101011010111001111110011111111000101110000110011111100111111001111111100010111000010001111110011111100111111101110101101011100111111111000011010101101011110 c5c33f3f3fc5c23f3f3fbad73f3fc5c33f3f3fc5c23f3f3fbad73fe1ab5e
UTF-8 澱ㆁ렰렕殿골렰렧祭재섞澱ㆁ렰렕殿골렰렧祭잼甄^ 11100110101111101011000111100011100001101000000111101011101000001011000011101011101000001001010111100110101011101011111111101010101100111010100011101011101000001011000011101011101000001010011111100111101001011010110111101100100111101010110011101100100001001001111011100110101111101011000111100011100001101000000111101011101000001011000011101011101000001001010111100110101011101011111111101010101100111010100011101011101000001011000011101011101000001010011111100111101001011010110111101100100111101011110011100111100101001000010001011110 e6beb1e38681eba0b0eba095e6aebfeab3a8eba0b0eba0a7e7a5adec9eacec849ee6beb1e38681eba0b0eba095e6aebfeab3a8eba0b0eba0a7e7a5adec9ebce794845e
UHC 澱ㆁ렰렕殿골렰렧祭재섞澱ㆁ렰렕殿골렰렧祭잼甄^ 111011101111111010100100111100011000111010111101100011101010101011101110111111001011000011110001100011101011110110001110101101101111000010101110110000001110011110111100101011111110111011111110101001001111000110001110101111011000111010101010111011101111110010110000111100011000111010111101100011101011011011110000101011101100000011101011110011001011010001011110 eefea4f18ebd8eaaeefcb0f18ebd8eb6f0aec0e7bcafeefea4f18ebd8eaaeefcb0f18ebd8eb6f0aec0ebccb45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)