To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8諭??議?????壹?┃怨λ?? 111000011001111100111111100000100101011110010111010000000011111100111111100010110110001100111111001111110011111100111111001111111001101011100011001111111000010010101011100010011000010110000011110010010011111100111111 e19f3f825797403f3f8b633f3f3f3f3f9ae33f84ab898583c93f3f
EUC-JP 癲?8諭??議?????壹?┃怨λ?孼 1110001010100001001111111010001110111000110011011010000100111111001111111011010111000100001111110011111100111111001111110011111111010100111001010011111110101000101011011011000111100101101001101100101100111111100011111011101011000011 e2a13fa3b8cda13f3fb5c43f3f3f3f3fd4e53fa8adb1e5a6cb3f8fbac3
UTF-8 癲쒕8諭룡쾮議얜옙亮쎈뜉壹삯┃怨λ젦孼 1110011110011001101100101110110010010010100101011110111110111100100110001110100010101011101011011110101110100011101000011110110010111110101011101110100010101101101100001110110010010110100111001110110010011000100110011110111110100101101101111110110010001110100010001110101110011100100010011110010110100011101110011110110010000010101011111110001010010100100000111110011010000000101010001100111010111011111011001010000010100110111001011010110110111100 e799b2ec9295efbc98e8abadeba3a1ecbeaee8adb0ec969cec9899efa5b7ec8e88eb9c89e5a3b9ec82afe29483e680a8cebbeca0a6e5adbc
UHC 癲쒕8諭룡쾮議얜옙亮쎈뜉壹삯┃怨λ젦孼 1110111110100110100111001110101110100011101110001110101110110001101101111110011010110010100001011110110010100001101111101110101110111111101111011110010110111001101111011110101110001101100011001110110011101100101110111110100110100110101011011110101010110011101001011110101110100000100111101110010111101101 efa69ceba3b8ebb1b7e6b285eca1beebbfbde5b9bdeb8d8cececbbe9a6adeab3a5eba09ee5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)