To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鞨゙邇シセ謗゙゙邇シズ鞨゙邇シセ謗゙゙邇シスワ^ 111010001110000011011110111001111000111010111100101111101110011010001110110111101101111011100111100011101011110010111101110111101110100011100000110111101110011110001110101111001011111011100110100011101101111011011110111001111000111010111100101111011101110001011110 e8e0dee78ebcbee68ededee78ebcbddee8e0dee78ebcbee68ededee78ebcbddc5e
EUC-JP 鞨゙邇シセ謗゙゙邇シズ鞨゙邇シセ謗゙゙邇シスワ^ 11110000111000101000111011011110111011011110111010001110101111001000111010111110111010111110111010001110110111101000111011011110111011011110111010001110101111001000111010111101100011101101111011110000111000101000111011011110111011011110111010001110101111001000111010111110111010111110111010001110110111101000111011011110111011011110111010001110101111001000111010111101100011101101110001011110 f0e28edeedee8ebc8ebeebee8ede8edeedee8ebc8ebd8edef0e28edeedee8ebc8ebeebee8ede8edeedee8ebc8ebd8edc5e
UTF-8 鞨゙邇シセ謗゙゙邇シズ鞨゙邇シセ謗゙゙邇シスワ^ 11101001100111101010100011101111101111101001111011101001100000101000011111101111101111011011110011101111101111011011111011101000101011001001011111101111101111101001111011101111101111101001111011101001100000101000011111101111101111011011110011101111101111011011110111101111101111101001111011101001100111101010100011101111101111101001111011101001100000101000011111101111101111011011110011101111101111011011111011101000101011001001011111101111101111101001111011101111101111101001111011101001100000101000011111101111101111011011110011101111101111011011110111101111101111101001110001011110 e99ea8efbe9ee98287efbdbcefbdbee8ac97efbe9eefbe9ee98287efbdbcefbdbdefbe9ee99ea8efbe9ee98287efbdbcefbdbee8ac97efbe9eefbe9ee98287efbdbcefbdbdefbe9c5e
UHC 鞨?邇??謗??邇???鞨?邇??謗??邇???^ 110010101110101000111111111011001100010000111111001111111101101110111111001111110011111111101100110001000011111100111111001111111100101011101010001111111110110011000100001111110011111111011011101111110011111100111111111011001100010000111111001111110011111101011110 caea3fecc43f3fdbbf3f3fecc43f3f3fcaea3fecc43f3fdbbf3f3fecc43f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)