To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??^nf??^n^}Y??^nf??^n^}bE 00111111001111110101111001101110011001100011111100111111010111100110111001011110011111010101100100111111001111110101111001101110011001100011111100111111010111100110111001011110011111010110001001000101 3f3f5e6e663f3f5e6e5e7d593f3f5e6e663f3f5e6e5e7d6245
SJIS-WIN 嘆俗^nf嘆俗^n^}Y嘆俗^nf嘆俗^n^}bE 100100100101000110010001101011010101111001101110011001101001001001010001100100011010110101011110011011100101111001111101010110011001001001010001100100011010110101011110011011100110011010010010010100011001000110101101010111100110111001011110011111010110001001000101 925191ad5e6e66925191ad5e6e5e7d59925191ad5e6e66925191ad5e6e5e7d6245
EUC-JP 嘆俗^nf嘆俗^n^}Y嘆俗^nf嘆俗^n^}bE 110000111011001011000010101011110101111001101110011001101100001110110010110000101010111101011110011011100101111001111101010110011100001110110010110000101010111101011110011011100110011011000011101100101100001010101111010111100110111001011110011111010110001001000101 c3b2c2af5e6e66c3b2c2af5e6e5e7d59c3b2c2af5e6e66c3b2c2af5e6e5e7d6245
UTF-8 嘆俗^nf嘆俗^n^}Y嘆俗^nf嘆俗^n^}bE 1110010110011000100001101110010010111111100101110101111001101110011001101110010110011000100001101110010010111111100101110101111001101110010111100111110101011001111001011001100010000110111001001011111110010111010111100110111001100110111001011001100010000110111001001011111110010111010111100110111001011110011111010110001001000101 e59886e4bf975e6e66e59886e4bf975e6e5e7d59e59886e4bf975e6e66e59886e4bf975e6e5e7d6245
UHC 嘆俗^nf嘆俗^n^}Y嘆俗^nf嘆俗^n^}bE 111101111010001111100001110101000101111001101110011001101111011110100011111000011101010001011110011011100101111001111101010110011111011110100011111000011101010001011110011011100110011011110111101000111110000111010100010111100110111001011110011111010110001001000101 f7a3e1d45e6e66f7a3e1d45e6e5e7d59f7a3e1d45e6e66f7a3e1d45e6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)