To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??椅??濡μ?巍ル?悠?????筌?? 1110000110011111001111110011111110001000110101100011111100111111100101000100011110000011110010100011111110011011110110011000001110001011001111111001011101001001001111110011111100111111001111110011111111100010101000110011111100111111 e19f3f3f88d63f3f944783ca3f9bd9838b3f97493f3f3f3f3fe2a33f3f
EUC-JP 癲??椅??濡μ?巍ル?悠??洧??筌?? 11100010101000010011111100111111101100001101100000111111001111111100011110101000101001101100110000111111110101101101101110100101111010110011111111001101101010100011111100111111100011111100011110110100001111110011111111100100101001010011111100111111 e2a13f3fb0d83f3fc7a8a6cc3fd6dba5eb3fcdaa3f3f8fc7b43f3fe4a53f3f
UTF-8 癲ㅻ슡椅썲뵱濡μ돺巍ル쵐悠썸갭洧붿몚筌앹퉹 1110011110011001101100101110001110000101101110111110110010001010101000011110011010100100100001011110110010001101101100101110101110110101101100011110011010111111101000011100111010111100111010111000111110111010111001011011011110001101111000111000001110101011111011001011010110010000111001101000001010100000111011001000110110111000111010101011000010101101111001101011010010100111111010111011011010111111111010111010101010011010111001111010110110001100111011001001010110111001111011011000100110111001 e799b2e385bbec8aa1e6a485ec8db2ebb5b1e6bfa1cebceb8fbae5b78de383abecb590e682a0ec8db8eab0ade6b4a7ebb6bfebaa9ae7ad8cec95b9ed89b9
UHC 癲ㅻ슡椅썲뵱濡μ돺巍ル쵐悠썸갭洧붿몚筌앹퉹 111011111010011010100100111010111001101010101101111010111111010110111101111001011001010010101111111010111010000110100101111011001000100110111101111010001110010010101011111010111010110010010010111010101110110110111101111001101011000010111000111010101111101110010100111011001001000110001000111011111010011110011101111011001011100110010001 efa6a4eb9aadebf5bde594afeba1a5ec89bde8e4abebac92eaedbde6b0b8eafb94ec9188efa79decb991

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)