To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???靭??音??筌??維??幽??雅?? 001111110011111100111111100100000111100000111111001111111000100110111001001111110011111111100010101000110011111100111111100010001101101100111111001111111001011101001000001111110011111110001001111010110011111100111111 3f3f3f90783f3f89b93f3fe2a33f3f88db3f3f97483f3f89eb3f3f
EUC-JP ???靭??音??筌??維??幽??雅?? 001111110011111100111111101111111101100100111111001111111011001010111011001111110011111111100100101001010011111100111111101100001101110100111111001111111100110110101001001111110011111110110010111011010011111100111111 3f3f3fbfd93f3fb2bb3f3fe4a53f3fb0dd3f3fcda93f3fb2ed3f3f
UTF-8 麗몃쓷靭뚦첎音섏굳筌딄퍗維믥솾幽됰폀雅뚯쉰 111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001101010100110111011001011001010001110111010011001111110110011111011001000010010001111111010101011010110110011111001111010110110001100111010111001010010000100111011011000110110010111111001111011011010101101111010111010111110100101111011001000011010111110111001011011100110111101111010111001000010110000111011011000111110000000111010011001101110000101111010111001101010101111111011001000100110110000 efa688ebaa83ec93b7e99dadeb9aa6ecb28ee99fb3ec848feab5b3e7ad8ceb9484ed8d97e7b6adebafa5ec86bee5b9bdeb90b0ed8f80e99b85eb9aafec89b0
UHC 麗몃쓷靭뚦첎音섏굳筌딄퍗維믥솾幽됰폀雅뚯쉰 111001101011000010111000111010111001110110010100111011001110010110001100111001011010101010011011111010111110010110011000111011001011000110111011111011111010011110001010111010101011101110001110111010111010101110010010111001111001100110110010111010101110101110001001111010111011110010001111111001001011101010001100111011001011110110101110 e6b0b8eb9d94ece58ce5aa9bebe598ecb1bbefa78aeabb8eebab92e799b2eaeb89ebbc8fe4ba8cecbdae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)