To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?N?????????N????????^ 001111110100111000111111001111110011111100111111001111110011111100111111001111110011111101001110001111110011111100111111001111110011111100111111001111110011111101011110 3f4e3f3f3f3f3f3f3f3f3f4e3f3f3f3f3f3f3f3f5e
SJIS-WIN 短N属辿孫歎嘆俗損足短N属辿孫歎嘆俗損足^ 100100100101101001001110100100011010111010010010010010001001000110110111100100100101011010010010010100011001000110101101100100011011100110010001101010111001001001011010010011101001000110101110100100100100100010010001101101111001001001010110100100100101000110010001101011011001000110111001100100011010101101011110 925a4e91ae924891b79256925191ad91b991ab925a4e91ae924891b79256925191ad91b991ab5e
EUC-JP 短N属辿孫歎嘆俗損足短N属辿孫歎嘆俗損足^ 110000111011101101001110110000101011000011000011101010011100001010111001110000111011011111000011101100101100001010101111110000101011101111000010101011011100001110111011010011101100001010110000110000111010100111000010101110011100001110110111110000111011001011000010101011111100001010111011110000101010110101011110 c3bb4ec2b0c3a9c2b9c3b7c3b2c2afc2bbc2adc3bb4ec2b0c3a9c2b9c3b7c3b2c2afc2bbc2ad5e
UTF-8 短N属辿孫歎嘆俗損足短N属辿孫歎嘆俗損足^ 111001111001111110101101010011101110010110110001100111101110100010111110101111111110010110101101101010111110011010101101100011101110010110011000100001101110010010111111100101111110011010010000100011011110100010110110101100111110011110011111101011010100111011100101101100011001111011101000101111101011111111100101101011011010101111100110101011011000111011100101100110001000011011100100101111111001011111100110100100001000110111101000101101101011001101011110 e79fad4ee5b19ee8bebfe5adabe6ad8ee59886e4bf97e6908de8b6b3e79fad4ee5b19ee8bebfe5adabe6ad8ee59886e4bf97e6908de8b6b35e
UHC 短N??孫歎嘆俗損足短N??孫歎嘆俗損足^ 1101001110101101010011100011111100111111111000011101110111110111101001111111011110100011111000011101010011100001110111111111000011101011110100111010110101001110001111110011111111100001110111011111011110100111111101111010001111100001110101001110000111011111111100001110101101011110 d3ad4e3f3fe1ddf7a7f7a3e1d4e1dff0ebd3ad4e3f3fe1ddf7a7f7a3e1d4e1dff0eb5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)