To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????¨??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111101010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3fa83f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥?∥宜?????猥?¨有?????????? 1110010111110001001111111000000101100001100010110101100000111111001111110011111100111111001111111110000011001110001111111000000101001110100101110100110000111111001111110011111100111111001111110011111100111111001111110011111100111111 e5f13f81618b583f3f3f3f3fe0ce3f814e974c3f3f3f3f3f3f3f3f3f3f
EUC-JP 褥?‖宜?????猥?¨有????????靷? 11101010111100110011111110100001110000101011010110111001001111110011111100111111001111110011111111100000110100000011111110100001101011111100110110101101001111110011111100111111001111110011111100111111001111110011111110001111111001111011110100111111 eaf33fa1c2b5b93f3f3f3f3fe0d03fa1afcdad3f3f3f3f3f3f3f3f8fe7bd3f
UTF-8 褥띕∥宜방끽戮녹춷猥됰¨有멩뇻類ㅻ뮋嶺뚮뿫靷뾄 1110100010100100101001011110101110011101100101011110001010001000101001011110010110101110100111001110101110110000101010011110101110000001101111011110111110100111100100101110101110000101101110011110110010110110101101111110011110001100101001011110101110010000101100001100001010101000111001101001110010001001111010111010100110101001111010111000011110111011111011111010011110010000111000111000010110111011111010111010111010001011111011111010011010101011111010111001101010101110111010111011111110101011111010011001110110110111111010111011111010000100 e8a4a5eb9d95e288a5e5ae9cebb0a9eb81bdefa792eb85b9ecb6b7e78ca5eb90b0c2a8e69c89eba9a9eb87bbefa790e385bbebae8befa6abeb9aaeebbfabe99db7ebbe84
UHC 褥띕∥宜방끽戮녹춷猥됰¨有멩뇻類ㅻ뮋嶺뚮뿫靷뾄 11101001101100111011011011101011101000011010101111101011111100011011100111100110101100111010001111101011101111011011001111101100101011011001001111101000111001011000100111101011101000011010011111101010111100111011100011100110101101001010011111101011101110101010010011101011100100101001100111100111101011011000110011101011100101111010101111101100111001101001011101000010 e9b3b6eba1abebf1b9e6b3a3ebbdb3ecad93e8e589eba1a7eaf3b8e6b4a7ebbaa4eb9299e7ad8ceb97abece69742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)