To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???猷??喩??押る?裕?????腰?? 001111110011111100111111100101110101000100111111001111111001101001100111001111110011111110001001100111111000001011101001001111111001011101010100001111110011111100111111001111110011111110001101100110000011111100111111 3f3f3f97513f3f9a673f3f899f82e93f97543f3f3f3f3f8d983f3f
EUC-JP ???猷??喩??押る?裕??洧??腰?? 0011111100111111001111111100110110110010001111110011111111010011110010000011111100111111101100101010000110100100111010110011111111001101101101010011111100111111100011111100011110110100001111110011111110111001111110000011111100111111 3f3f3fcdb23f3fd3c83f3fb2a1a4eb3fcdb53f3f8fc7b43f3fb9f83f3f
UTF-8 麗몃씈猷딃렟喩쎻봼押る굟裕딀갭洧뺛뀎腰뱀뙌 111011111010011010001000111010111010101010000011111011001001010010001000111001111000110010110111111010111001010010000011111010111010000010011111111001011001011010101001111011001000111010111011111010111011010010111100111001101000101010111100111000111000001010001011111010101011010110011111111010001010001110010101111010111001010010000000111010101011000010101101111001101011010010100111111010111011101010011011111010111000000010001110111010001000010110110000111010111011000110000000111010111001100110001100 efa688ebaa83ec9488e78cb7eb9483eba09fe596a9ec8ebbebb4bce68abce3828beab59fe8a395eb9480eab0ade6b4a7ebba9beb808ee885b0ebb180eb998c
UHC 麗몃씈猷딃렟喩쎻봼押る굟裕딀갭洧뺛뀎腰뱀뙌 111001101011000010111000111010111001110110100000111010111010001110001010111010011000111010110000111010101110011110011011111000101001010010000011111001001110001110101010111010111000001010000111111010111010111010001010111001101011000010111000111010101111101110010101111000111000010110001001111010011010011010111001111011001000110010010001 e6b0b8eb9da0eba38ae98eb0eae79be29483e4e3aaeb8287ebae8ae6b0b8eafb95e38589e9a6b9ec8c91

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)