To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 筌??乙??儒??馭??乙?????鸚??h 11100010101000110011111100111111100010011011001100111111001111111000111011110010001111110011111111101001011001100011111100111111100010011011001100111111001111110011111100111111001111111110101001011111001111110011111101101000 e2a33f3f89b33f3f8ef23f3fe9663f3f89b33f3f3f3f3fea5f3f3f68
EUC-JP 筌??乙??儒??馭??乙?????鸚??h 11100100101001010011111100111111101100101011010100111111001111111011110011110100001111110011111111110001110001110011111100111111101100101011010100111111001111110011111100111111001111111111001111000000001111110011111101101000 e4a53f3fb2b53f3fbcf43f3ff1c73f3fb2b53f3f3f3f3ff3c03f3f68
UTF-8 筌㏂끋乙쎿걖儒뱀퐧馭곷툝乙억㎖栒쎌굷鸚룸뿿h 11100111101011011000110011100011100011111000001011101011100000011000101111100100101110011001100111101100100011101011111111101010101100011001011011100101100001001001001011101011101100011000000011101101100100001010011111101001101001101010110111101010101100111011011111101101100010001001110111100100101110011001100111101100100101101011010111100011100011101001011011100110101000001001001011101100100011101000110011101010101101011011011111101001101110001001101011101011101000111011100011101011101111111011111101101000 e7ad8ce38f82eb818be4b999ec8ebfeab196e58492ebb180ed90a7e9a6adeab3b7ed889de4b999ec96b5e38e96e6a092ec8e8ceab5b7e9b89aeba3b8ebbfbf68
UHC 筌㏂끋乙쎿걖儒뱀퐧馭곷툝乙억㎖栒쎌굷鸚룸뿿h 11101111101001111010001011100011100001011011110111101011111000001001101111100110100000011000000111101010111000111011100111101100101111011001000011100101110111111000000111101011101110001001010011101011111000001011111011101111101001111010001011100010111000111011110111101100100000101001011011100101101001001011011111101011100101111011111101101000 efa7a2e385bdebe09be68181eae3b9ecbd90e5df81ebb894ebe0beefa7a2e2e3bdec8296e5a4b7eb97bf68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)