To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??泣??碎??嚥〓?逾??膺??曖 100110100110101000111111001111111000101110000011001111110011111111100001111010100011111100111111100110101000101110000001101011000011111111100111101001010011111100111111111001000101111000111111001111111001111001000010 9a6a3f3f8b833f3fe1ea3f3f9a8b81ac3fe7a53f3fe45e3f3f9e42
EUC-JP 嗚??泣??碎??嚥〓?逾??膺??曖 110100111100101100111111001111111011010111100011001111110011111111100010111011000011111100111111110100111110101110100010101011100011111111101110101001110011111100111111111001111011111100111111001111111101101110100011 d3cb3f3fb5e33f3fe2ec3f3fd3eba2ae3feea73f3fe7bf3f3fdba3
UTF-8 嗚삠굦泣쒙쭏碎띔괵嚥〓뀈逾띄뛾膺쇰짋曖 111001011001011110011010111011001000001010100000111010101011010110100110111001101011001110100011111011001001001010011001111011001010110110001111111001111010001010001110111010111001110110010100111010101011010010110101111001011001101010100101111000111000000010010011111010111000000010001000111010011000000010111110111010111001110110000100111010111001101110111110111010001000011010111010111011001000011110110000111011001010011110001011111001101001101110010110 e5979aec82a0eab5a6e6b3a3ec9299ecad8fe7a28eeb9d94eab4b5e59aa5e38093eb8088e980beeb9d84eb9bbee886baec87b0eca78be69b96
UHC 嗚삠굦泣쒙쭏碎띔괵嚥〓뀈逾띄뛾膺쇰짋曖 1110011111110000101110111110001110000010100011001110101111101000100111001110111110100111100010001110000111101111101101101110101010110001101011001110011010111111101000011110101110000101100001001110101110110101101101101110011110001101100001001110101111101100101111001110101110100011100101111110010011110010 e7f0bbe3828cebe89cefa788e1efb6eab1ace6bfa1eb8584ebb5b6e78d84ebecbceba397e4f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)