To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ゆ??цぐ娃??藥??岳??耶??藥?? 100110101100100010000010111001000011111100111111100001001000100010000010101011101000100010100001001111110011111111100101010110100011111100111111100010100111100000111111001111111001011011101011001111110011111111100101010110100011111100111111 9ac882e43f3f848882ae88a13f3fe55a3f3f8a783f3f96eb3f3fe55a3f3f
EUC-JP 塋ゆ??цぐ娃??藥??岳??耶??藥?? 110101001100101010100100111001100011111100111111101001111110100010100100101100001011000010100011001111110011111111101001101110110011111100111111101100111101100100111111001111111100110011101101001111110011111111101001101110110011111100111111 d4caa4e63f3fa7e8a4b0b0a33f3fe9bb3f3fb3d93f3fcced3f3fe9bb3f3f
UTF-8 塋ゆ춲歷цぐ娃쒎뜵藥썹궘岳껇쒼耶섉릫藥썸씇 1110010110100001100010111110001110000010100001101110110010110110101100101110111110100110100011001101000110000110111000111000000110010000111001011010100010000011111011001001001010001110111010111001110010110101111010001001011110100101111011001000110110111001111010101011011010011000111001011011001010110011111010101011101110000111111011001001001010111100111010001000000010110110111011001000010010001001111010111010011010101011111010001001011110100101111011001000110110111000111011001001010010000111 e5a18be38286ecb6b2efa68cd186e38190e5a883ec928eeb9cb5e897a5ec8db9eab698e5b2b3eabb87ec92bce880b6ec8489eba6abe897a5ec8db8ec9487
UHC 塋ゆ춲歷цぐ娃쒎뜵藥썹궘岳껇쒼耶섉릫藥썸씇 111001111010101110101010111001101010110110001110111001101011100010101100111010001010101010110000111010001101111110011100111001011000110110110011111001011011011110111101111001111000001010101101111001001011111110000011111010001011111010110000111001011010110110011000111001101001000010001101111001011011011110111101111001101001110110011111 e7abaae6ad8ee6b8ace8aab0e8df9ce58db3e5b7bde782ade4bf83e8beb0e5ad98e6908de5b7bde69d9f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)