To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?踰??幽??阿??猷??儀??魚?? 1110000110011111100000111000101100111111111001101111101000111111001111111001011101001000001111110011111110001000101000100011111100111111100101110101000100111111001111111000101101010110001111110011111110001011100110110011111100111111 e19f838b3fe6fa3f3f97483f3f88a23f3f97513f3f8b563f3f8b9b3f3f
EUC-JP 癲ル?踰??幽??阿??猷??儀??魚?? 1110001010100001101001011110101100111111111011001111110000111111001111111100110110101001001111110011111110110000101001000011111100111111110011011011001000111111001111111011010110110111001111110011111110110101111110110011111100111111 e2a1a5eb3fecfc3f3fcda93f3fb0a43f3fcdb23f3fb5b73f3fb5fb3f3f
UTF-8 癲ル슢踰됵쭫幽껊짎阿숈뇠猷뗰쬉儀볥옩魚좏뇫 111001111001100110110010111000111000001110101011111011001000101010100010111010001011100010110000111010111001000010110101111011001010110110101011111001011011100110111101111010101011101110001010111011001010011110001110111010011001100010111111111011001000100010001000111010111000011110100000111001111000110010110111111010111001011110110000111011001010110010001001111001011000010010000000111010111011001110100101111011001001100010101001111010011010110110011010111011001010001010001111111010111000011110101011 e799b2e383abec8aa2e8b8b0eb90b5ecadabe5b9bdeabb8aeca78ee998bfec8888eb87a0e78cb7eb97b0ecac89e58480ebb3a5ec98a9e9ad9aeca28feb87ab
UHC 癲ル슢踰됵쭫幽껊짎阿숈뇠猷뗰쬉儀볥옩魚좏뇫 111011111010011010101011111010111001101010101110111010111011001010001001111011111010011110011111111010101110101110000011111010111010001110011010111001001011100110011001111011001000011110001000111010111010001110001011111011111010011010011111111010111111000010010011111010111001111010101000111001011110000010100000111011011000011110010001 efa6abeb9aaeebb289efa79feaeb83eba39ae4b999ec8788eba38befa69febf093eb9ea8e5e0a0ed8791

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)