To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??乙??純??馭??吟?????筌??B 11100010101000110011111100111111100010011011001100111111001111111000111110000011001111110011111111101001011001100011111100111111100010111110000100111111001111110011111100111111001111111110001010100011001111110011111101000010 e2a33f3f89b33f3f8f833f3fe9663f3f8be13f3f3f3f3fe2a33f3f42
EUC-JP 筌??乙??純??馭??吟?????筌??B 11100100101001010011111100111111101100101011010100111111001111111011110111100011001111110011111111110001110001110011111100111111101101101110001100111111001111110011111100111111001111111110010010100101001111110011111101000010 e4a53f3fb2b53f3fbde33f3ff1c73f3fb6e33f3f3f3f3fe4a53f3f42
UTF-8 筌㏂끋乙멩걖純껉킅馭곣뫂吟랃㎖栒쎌궅筌욎ㅃB 11100111101011011000110011100011100011111000001011101011100000011000101111100100101110011001100111101011101010011010100111101010101100011001011011100111101101001001010011101010101110111000100111101101100000101000010111101001101001101010110111101010101100111010001111101011101010111000001011100101100100001001111111101011100111101000001111100011100011101001011011100110101000001001001011101100100011101000110011101010101101101000010111100111101011011000110011101100100110101000111011100011100001011000001101000010 e7ad8ce38f82eb818be4b999eba9a9eab196e7b494eabb89ed8285e9a6adeab3a3ebab82e5909feb9e83e38e96e6a092ec8e8ceab685e7ad8cec9a8ee3858342
UHC 筌㏂끋乙멩걖純껉킅馭곣뫂吟랃㎖栒쎌궅筌욎ㅃB 11101111101001111010001011100011100001011011110111101011111000001011100011100110100000011000000111100010111011011000001111101010101101001001000111100101110111111000000111100010100100011010011011101011111000011000110111101111101001111010001011100010111000111011110111101100100000101001111011101111101001111001111011101100101001001011001101000010 efa7a2e385bdebe0b8e68181e2ed83eab491e5df81e291a6ebe18defa7a2e2e3bdec829eefa79eeca4b342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)