To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鏑?昱??孟?調?姐?蜘昱??孟?調?姐 100100110100110000111111111110100110001100111111001111111001011011010000001111111001001010110010001111111000100010110111001111111001001001110111111110100110001100111111001111111001011011010000001111111001001010110010001111111000100010110111 934c3ffa633f3f96d03f92b23f88b73f9277fa633f3f96d03f92b23f88b7
EUC-JP 鏑?昱??孟?調?姐?蜘昱??孟?調?姐 1100010110101101001111111000111111000010101011010011111100111111110011001101001000111111110001001011010000111111101100001011100100111111110000111101100010001111110000101010110100111111001111111100110011010010001111111100010010110100001111111011000010111001 c5ad3f8fc2ad3f3fccd23fc4b43fb0b93fc3d88fc2ad3f3fccd23fc4b43fb0b9
UTF-8 鏑렫昱겻롛孟웃調렪姐븅蜘昱겻롛孟웃調렪姐 111010011000111110010001111010111010000010101011111001101001100010110001111010101011001010111011111010111010000110011011111001011010110110011111111011001001101110000011111010001010101010111111111010111010000010101010111001011010011110010000111010111011100010000101111010001001110010011000111001101001100010110001111010101011001010111011111010111010000110011011111001011010110110011111111011001001101110000011111010001010101010111111111010111010000010101010111001011010011110010000 e98f91eba0abe698b1eab2bbeba19be5ad9fec9b83e8aabfeba0aae5a790ebb885e89c98e698b1eab2bbeba19be5ad9fec9b83e8aabfeba0aae5a790
UHC 鏑렫昱겻롛孟웃調렪姐븅蜘昱겻롛孟웃調렪姐 11101110111010111000111010111001111010011111000010110000111001001000111011011111110110001110101110111111111101001111000011100000100011101011100011101110101110111011101011101001111100101011101111101001111100001011000011100100100011101101111111011000111010111011111111110100111100001110000010001110101110001110111010111011 eeeb8eb9e9f0b0e48edfd8ebbff4f0e08eb8eebbbae9f2bbe9f0b0e48edfd8ebbff4f0e08eb8eebb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)