To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 濡??烏???????????濡??濡??B 1001010001000111001111110011111110001001010001110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100101000100011100111111001111111001010001000111001111110011111101000010 94473f3f89473f3f3f3f3f3f3f3f3f3f3f94473f3f94473f3f42
EUC-JP 濡??烏???????????濡??濡??B 1100011110101000001111110011111110110001101010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111110001111010100000111111001111111100011110101000001111110011111101000010 c7a83f3fb1a83f3f3f3f3f3f3f3f3f3f3fc7a83f3fc7a83f3f42
UTF-8 濡띾졂烏숇죲溜노죲溜싲죲溜녘줂濡띾젡濡띾젇B 11100110101111111010000111101011100111011011111011101100101000011000001011100111100000111000111111101100100010001000011111101100101000111011001011101111101001111000101111101011100001011011100011101100101000111011001011101111101001111000101111101100100010111011001011101100101000111011001011101111101001111000101111101011100001011001100011101100101001001000001011100110101111111010000111101011100111011011111011101100101000001010000111100110101111111010000111101011100111011011111011101100101000001000011101000010 e6bfa1eb9dbeeca182e7838fec8887eca3b2efa78beb85b8eca3b2efa78bec8bb2eca3b2efa78beb8598eca482e6bfa1eb9dbeeca0a1e6bfa1eb9dbeeca08742
UHC 濡띾졂烏숇죲溜노죲溜싲죲溜녘줂濡띾젡濡띾젇B 11101011101000011000110111101011101000001011001111101000101000011001100111101011101000011000110111101010111111101011001111101011101000011000110111101010111111101001101011101011101000011000110111101010111111101011001111101000101000011001100111101011101000011000110111101011101000001001101011101011101000011000110111101011101000001000101001000010 eba18deba0b3e8a199eba18deafeb3eba18deafe9aeba18deafeb3e8a199eba18deba09aeba18deba08a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)