To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?踰??邑??碍?????猿????? 111001001110100010000010111010100011111111100110111110100011111100111111100101110101011100111111001111111000101001010110001111110011111100111111001111110011111110001001100011100011111100111111001111110011111100111111 e4e882ea3fe6fa3f3f97573f3f8a563f3f3f3f3f898e3f3f3f3f3f
EUC-JP 蒻れ?踰??邑??碍?????猿????? 111010001110101010100100111011000011111111101100111111000011111100111111110011011011100000111111001111111011001110110111001111110011111100111111001111110011111110110001111011100011111100111111001111110011111100111111 e8eaa4ec3fecfc3f3fcdb83f3fb3b73f3f3f3f3fb1ee3f3f3f3f3f
UTF-8 蒻れ슜踰⑼쭗邑룹쵅碍⑸낌留볢뀑猿딅쎗歷몄쭫 111010001001001010111011111000111000001010001100111011001000101010011100111010001011100010110000111000101001000110111100111011001010110110010111111010011000001010010001111010111010001110111001111011001011010110000101111001111010001010001101111000101001000110111000111010111000001010001100111011111010011110001101111010111011001110100010111010111000000010010001111001111000110010111111111010111001010010000101111011001000111010010111111011111010011010001100111010111010101010000100111011001010110110101011 e892bbe3828cec8a9ce8b8b0e291bcecad97e98291eba3b9ecb585e7a28de291b8eb828cefa78debb3a2eb8091e78cbfeb9485ec8e97efa68cebaa84ecadab
UHC 蒻れ슜踰⑼쭗邑룹쵅碍⑸낌留볢뀑猿딅쎗歷몄쭫 111001011011011010101010111011001001101010101001111010111011001010101001111011111010011110001111111010111110100110110111111011001010110010000111111001001111010010101001111010111011001110100110111010111010011110010011111010001000010110001011111010101011101110001010111010111001101110111110111001101011100010111000111011001010011110011111 e5b6aaec9aa9ebb2a9efa78febe9b7ecac87e4f4a9ebb3a6eba793e8858beabb8aeb9bbee6b8b8eca79f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)