To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??吟??銀ъ?馭??乙?????筌??B 1110001010100011001111110011111110001011111000010011111100111111100010111110001010000100100011000011111111101001011001100011111100111111100010011011001100111111001111110011111100111111001111111110001010100011001111110011111101000010 e2a33f3f8be13f3f8be2848c3fe9663f3f89b33f3f3f3f3fe2a33f3f42
EUC-JP 筌??吟??銀ъ?馭??乙?????筌??B 1110010010100101001111110011111110110110111000110011111100111111101101101110010010100111111011000011111111110001110001110011111100111111101100101011010100111111001111110011111100111111001111111110010010100101001111110011111101000010 e4a53f3fb6e33f3fb6e4a7ec3ff1c73f3fb2b53f3f3f3f3fe4a53f3f42
UTF-8 筌㏂끋吟꾣걖銀ъ퐧馭곷벏乙싷㎖栒쎌굮筌욌퍌B 111001111010110110001100111000111000111110000010111010111000000110001011111001011001000010011111111010101011111010100011111010101011000110010110111010011000101010000000110100011000101011101101100100001010011111101001101001101010110111101010101100111011011111101011101100101000111111100100101110011001100111101100100010111011011111100011100011101001011011100110101000001001001011101100100011101000110011101010101101011010111011100111101011011000110011101100100110101000110011101101100011011000110001000010 e7ad8ce38f82eb818be5909feabea3eab196e98a80d18aed90a7e9a6adeab3b7ebb28fe4b999ec8bb7e38e96e6a092ec8e8ceab5aee7ad8cec9a8ced8d8c42
UHC 筌㏂끋吟꾣걖銀ъ퐧馭곷벏乙싷㎖栒쎌굮筌욌퍌B 11101111101001111010001011100011100001011011110111101011111000011000010011100110100000011000000111101011110111101010110011101100101111011001000011100101110111111000000111101011100100111010111111101011111000001001101011101111101001111010001011100010111000111011110111101100100000101001001011101111101001111001111011101011101110111000001101000010 efa7a2e385bdebe184e68181ebdeacecbd90e5df81eb93afebe09aefa7a2e2e3bdec8292efa79eebbb8342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)