To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ゆ?倭??泳??瘟??秧??泳??瘟?? 1001101011001000100000101110010000111111100110000110000000111111001111111000100101101010001111110011111111100001100010010011111100111111111000100101111000111111001111111000100101101010001111110011111111100001100010010011111100111111 9ac882e43f98603f3f896a3f3fe1893f3fe25e3f3f896a3f3fe1893f3f
EUC-JP 塋ゆ?倭??泳??瘟??秧??泳??瘟?? 1101010011001010101001001110011000111111110011111100000100111111001111111011000111001011001111110011111111100001111010010011111100111111111000111011111100111111001111111011000111001011001111110011111111100001111010010011111100111111 d4caa4e63fcfc13f3fb1cb3f3fe1e93f3fe3bf3f3fb1cb3f3fe1e93f3f
UTF-8 塋ゆ뜆倭졿짎泳싨룂瘟룡뿈秧녘꽦泳싪큹瘟룝퓘 111001011010000110001011111000111000001010000110111010111001110010000110111001011000000010101101111011001010000110111111111011001010011110001110111001101011001110110011111011001000101110101000111010111010001110000010111001111001100010011111111010111010001110100001111010111011111110001000111001111010011110100111111010111000010110011000111010101011110110100110111001101011001110110011111011001000101110101010111011011000000110111001111001111001100010011111111010111010001110011101111011011001001110011000 e5a18be38286eb9c86e580adeca1bfeca78ee6b3b3ec8ba8eba382e7989feba3a1ebbf88e7a7a7eb8598eabda6e6b3b3ec8baaed81b9e7989feba39ded9398
UHC 塋ゆ뜆倭졿짎泳싨룂瘟룡뿈秧녘꽦泳싪큹瘟룝퓘 111001111010101110101010111001101000110110001001111010001101111010100000111001101010001110011010111001111011011010011010111001101000111110000011111010001011000010110111111001101001011110001111111001001110101110110011111010001000010010110001111001111011011010011010111010001011010010001000111010001011000010110111111001001011111110000011 e7abaae68d89e8dea0e6a39ae7b69ae68f83e8b0b7e6978fe4ebb3e884b1e7b69ae8b488e8b0b7e4bf83

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)