To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???伊??柔?????筍??攸?????邯B 00111111001111110011111110001000110010010011111100111111100011110101111100111111001111110011111100111111001111111110001010100001001111110011111110011101101111110011111100111111001111110011111100111111111001111011011001000010 3f3f3f88c93f3f8f5f3f3f3f3f3fe2a13f3f9dbf3f3f3f3f3fe7b642
EUC-JP ???伊??柔?????筍??攸?????邯B 00111111001111110011111110110000110010110011111100111111101111011100000000111111001111110011111100111111001111111110010010100011001111110011111111011010110000010011111100111111001111110011111100111111111011101011100001000010 3f3f3fb0cb3f3fbdc03f3f3f3f3fe4a33f3fdac13f3f3f3f3feeb842
UTF-8 蓮곸궡伊볡쳥柔ㅽ뱺列뜯뫒筍됮넫攸됲렮略노맧邯B 11101111101001101001100111101010101100111011100011101010101101101010000111100100101111001000101011101011101100111010000111101100101100111010010111100110100111111001010011100011100001011011110111101011101100011011101011101111101001101001110011101011100111001010111111101011101010111001001011100111101011011000110111101011100100001010111011101011100001001010101111100110100101001011100011101011100100001011001011101011101000001010111011101111101001011011011011101011100001011011100011101011101001111010011111101001100000101010111101000010 efa699eab3b8eab6a1e4bc8aebb3a1ecb3a5e69f94e385bdebb1baefa69ceb9cafebab92e7ad8deb90aeeb84abe694b8eb90b2eba0aeefa5b6eb85b8eba7a7e982af42
UHC 蓮곸궡伊볡쳥柔ㅽ뱺列뜯뫒筍됮넫攸됲렮略노맧邯B 111001101110010110000001111011001000001010110100111011001010010110010011111001111010101110001010111010101111010110100100111011011001001110100000111001101110101010110110111000101001000110110100111000101110110010001001111010011000011010101011111010101111001010001001111011011000111010111011111001011011001010110011111010111001000010110000110010101111101101000010 e6e581ec82b4eca593e7ab8aeaf5a4ed93a0e6eab6e291b4e2ec89e986abeaf289ed8ebbe5b2b3eb90b0cafb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)