To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??裕??癒る?域??愉??惟??曖 100010010101000100111111001111111001011101010100001111110011111110010110111111001000001011101001001111111000100011100110001111110011111110010110111110010011111100111111100010001101001000111111001111111001111001000010 89513f3f97543f3f96fc82e93f88e63f3f96f93f3f88d23f3f9e42
EUC-JP 渦??裕??癒る?域??愉??惟??曖 101100011011001000111111001111111100110110110101001111110011111111001100111111101010010011101011001111111011000011101000001111110011111111001100111110110011111100111111101100001101010000111111001111111101101110100011 b1b23f3fcdb53f3fccfea4eb3fb0e83f3fccfb3f3fb0d43f3fdba3
UTF-8 渦기뫁裕뗦끽癒る쎗域㏐퀣愉잏솾惟겹돖曖 111001101011100010100110111010101011100010110000111010111010101110000001111010001010001110010101111010111001011110100110111010111000000110111101111001111001100110010010111000111000001010001011111011001000111010010111111001011001111110011111111000111000111110010000111011011000000010100011111001101000010010001001111011001001111010001111111011001000011010111110111001101000001110011111111010101011001010111001111010111000111110010110111001101001101110010110 e6b8a6eab8b0ebab81e8a395eb97a6eb81bde79992e3828bec8e97e59f9fe38f90ed80a3e68489ec9e8fec86bee6839feab2b9eb8f96e69b96
UHC 渦기뫁裕뗦끽癒る쎗域㏐퀣愉잏솾惟겹돖曖 1110100010111110101100011110001010010001101001011110101110101110100010111110011010110011101000111110101110101000101010101110101110011011101111101110011010110100101001111110101010110011100101111110101011110000100111111110011110011001101100101110101011101110101100001110001110001001101000001110010011110010 e8beb1e291a5ebae8be6b3a3eba8aaeb9bbee6b4a7eab397eaf09fe799b2eaeeb0e389a0e4f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)