To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 娃??鈺??搖?ⅱ節わ?要??節??橈??獄?? 10001000101000010011111100111111111110111100010000111111001111111001110110001010001111111111101001000001100100001101111110000010111011010011111110010111011101100011111100111111100100001101111100111111001111111001111011110100001111110011111110001101100101100011111100111111 88a13f3ffbc43f3f9d8a3ffa4190df82ed3f97763f3f90df3f3f9ef43f3f8d963f3f
EUC-JP 娃??鈺??搖??節わ?要??節??橈??獄?? 10110000101000110011111100111111100011111110001111010101001111110011111111011001111010100011111100111111110000001110000110100100111011110011111111001101110101110011111100111111110000001110000100111111001111111101110011110110001111110011111110111001111101100011111100111111 b0a33f3f8fe3d53f3fd9ea3f3fc0e1a4ef3fcdd73f3fc0e13f3fdcf63f3fb9f63f3f
UTF-8 娃띰쉠鈺뚳쉐搖얏ⅱ節わ쉼要뺞뇠節몌쉬橈롳슴獄깍쉴 111001011010100010000011111010111001110110110000111011001000100110100000111010011000100010111010111010111001101010110011111011001000100110010000111001101001000010010110111011001001011010001111111000101000010110110001111001111010111110000000111000111000001010001111111011001000100110111100111010001010011010000001111010111011101010011110111010111000011110100000111001111010111110000000111010111010101010001100111011001000100110101100111001101010100110001000111010111010000110110011111011001000101010110100111001111000110110000100111010101011100110001101111011001000100110110100 e5a883eb9db0ec89a0e988baeb9ab3ec8990e69096ec968fe285b1e7af80e3828fec89bce8a681ebba9eeb87a0e7af80ebaa8cec89ace6a988eba1b3ec8ab4e78d84eab98dec89b4
UHC 娃띰쉠鈺뚳쉐搖얏ⅱ節わ쉼要뺞뇠節몌쉬橈롳슴獄깍쉴 111010001101111110110110111011111011110110101010111010001010110110001100111011111011110110100110111010001111010010111110111001101010010110100010111011111011110110101010111011111011110110110000111010011010100110010101111001101000011110001000111011111011110110111000111011111011110110101100111010001111101010001110111011111011110110111111111010001010101110110001111011111011110110101111 e8dfb6efbdaae8ad8cefbda6e8f4bee6a5a2efbdaaefbdb0e9a995e68788efbdb8efbdace8fa8eefbdbfe8abb1efbdaf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)