To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺??外?????節??邀??鈺??外?? 111110111100010000111111001111111000101001001111001111110011111100111111001111110011111110010000110111110011111100111111111001111011000100111111001111111111101111000100001111110011111110001010010011110011111100111111 fbc43f3f8a4f3f3f3f3f3f90df3f3fe7b13f3ffbc43f3f8a4f3f3f
EUC-JP 鈺??外??縕??節??邀??鈺??外?? 10001111111000111101010100111111001111111011001110110000001111110011111110001111110101001100001000111111001111111100000011100001001111110011111111101110101100110011111100111111100011111110001111010101001111110011111110110011101100000011111100111111 8fe3d53f3fb3b03f3f8fd4c23f3fc0e13f3feeb33f3f8fe3d53f3fb3b03f3f
UTF-8 鈺롳슐外뺧쉈縕붻뙣節쏙쉽邀섓쉥鈺롳슐外뺧쉭 111010011000100010111010111010111010000110110011111011001000101010010000111001011010010010010110111010111011101010100111111011001000100110001000111001111011100010010101111010111011011010111011111010111001100110100011111001111010111110000000111011001000111110011001111011001000100110111101111010011000001010000000111011001000010010010011111011001000100110100101111010011000100010111010111010111010000110110011111011001000101010010000111001011010010010010110111010111011101010100111111011001000100110101101 e988baeba1b3ec8a90e5a496ebbaa7ec8988e7b895ebb6bbeb99a3e7af80ec8f99ec89bde98280ec8493ec89a5e988baeba1b3ec8a90e5a496ebbaa7ec89ad
UHC 鈺롳슐外뺧쉈縕붻뙣節쏙쉽邀섓쉥鈺롳슐外뺧쉭 111010001010110110001110111011111011110110110110111010001110001010010101111011111011110110100101111010001011001010010100111010001000110010101000111011111011110110111101111011111011110110110001111010011010110110011000111011111011110110101011111010001010110110001110111011111011110110110110111010001110001010010101111011111011110110101101 e8ad8eefbdb6e8e295efbda5e8b294e88ca8efbdbdefbdb1e9ad98efbdabe8ad8eefbdb6e8e295efbdad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)