To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦???孃??誼??恂с????魏??汲陰 1110100111110001001111110011111100111111100110110110111100111111001111111000101101100010001111110011111110011100100101101000010010000011001111110011111100111111001111111110100110110000001111110011111110001011100000101000100101000001 e9f13f3f3f9b6f3f3f8b623f3f9c9684833f3f3f3fe9b03f3f8b828941
EUC-JP 鴦???孃??誼??恂с????魏??汲陰 1111001011110011001111110011111100111111110101011101000000111111001111111011010111000011001111110011111111010111111101101010011111100011001111110011111100111111001111111111001010110010001111110011111110110101111000101011000110100010 f2f33f3f3fd5d03f3fb5c33f3fd7f6a7e33f3f3f3ff2b23f3fb5e2b1a2
UTF-8 鴦꾆뀀쳳孃뉖뛽誼붹윍恂с돩嶺뚮벉魏꾬㎠汲陰 1110100110110100101001101110101010111110100001101110101110000000100000001110110010110011101100111110010110101101100000111110101110001001100101101110101110011011101111011110100010101010101111001110101110110110101110011110110010011100100011011110011010000001100000101101000110000001111010111000111110101001111011111010011010101011111010111001101010101110111010111011001010001001111010011010110110001111111010101011111010101100111000111000111010100000111001101011000110110010111010011001100110110000 e9b4a6eabe86eb8080ecb3b3e5ad83eb8996eb9bbde8aabcebb6b9ec9c8de68182d181eb8fa9efa6abeb9aaeebb289e9ad8feabeace38ea0e6b1b2e999b0
UHC 鴦꾆뀀쳳孃뉖뛽誼붹윍恂с돩嶺뚮벉魏꾬㎠汲陰 111001001110110010000100110011101011001011101011101010111001011011100101101111101000011111101011100011011000001111101011111111101001010011100110100111111001010011100010111000011010110011100011100010011010110011100111101011011000110011101011100100111010110011101010111000001000010011101111101001111011001011010000111000111110101111100100 e4ec84ceb2ebab96e5be87eb8d83ebfe94e69f94e2e1ace389ace7ad8ceb93aceae084efa7b2d0e3ebe4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)