To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????U??????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f553f3f3f3f3f3f3f
SJIS-WIN 馭??乙?????音??U馭??乙??? 11101001011001100011111100111111100010011011001100111111001111110011111100111111001111111000100110111001001111110011111101010101111010010110011000111111001111111000100110110011001111110011111100111111 e9663f3f89b33f3f3f3f3f89b93f3f55e9663f3f89b33f3f3f
EUC-JP 馭??乙??荑??音??U馭??乙??荑 1111000111000111001111110011111110110010101101010011111100111111100011111101011111111001001111110011111110110010101110110011111100111111010101011111000111000111001111110011111110110010101101010011111100111111100011111101011111111001 f1c73f3fb2b53f3f8fd7f93f3fb2bb3f3f55f1c73f3fb2b53f3f8fd7f9
UTF-8 馭곷벏乙뤸걖荑볩㎕音녹뒆U馭곷벏乙뤸걖荑 11101001101001101010110111101010101100111011011111101011101100101000111111100100101110011001100111101011101001001011100011101010101100011001011011101000100011011001000111101011101100111010100111100011100011101001010111101001100111111011001111101011100001011011100111101011100100101000011001010101111010011010011010101101111010101011001110110111111010111011001010001111111001001011100110011001111010111010010010111000111010101011000110010110111010001000110110010001 e9a6adeab3b7ebb28fe4b999eba4b8eab196e88d91ebb3a9e38e95e99fb3eb85b9eb928655e9a6adeab3b7ebb28fe4b999eba4b8eab196e88d91
UHC 馭곷벏乙뤸걖荑볩㎕音녹뒆U馭곷벏乙뤸걖荑 111001011101111110000001111010111001001110101111111010111110000010001111111001101000000110000001111011001011111110010011111011111010011110100001111010111110010110110011111011001000101010000100010101011110010111011111100000011110101110010011101011111110101111100000100011111110011010000001100000011110110010111111 e5df81eb93afebe08fe68181ecbf93efa7a1ebe5b3ec8a8455e5df81eb93afebe08fe68181ecbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)