To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??揖?┥轅??永??泣??音??? 1001011011101001001111110011111110010111010010110011111110000100101111001110011101110110001111110011111110001001011010010011111100111111100010111000001100111111001111111000100110111001001111110011111100111111 96e93f3f974b3f84bce7763f3f89693f3f8b833f3f89b93f3f3f
EUC-JP 夜??揖?┥轅??永??泣??音??旿 11001100111010110011111100111111110011011010110000111111101010001011111011101101110101110011111100111111101100011100101000111111001111111011010111100011001111110011111110110010101110110011111100111111100011111100000111110100 cceb3f3fcdac3fa8beedd73f3fb1ca3f3fb5e33f3fb2bb3f3f8fc1f4
UTF-8 夜껊씛揖좑┥轅고겲永띠옚泣덃릸音쎌돺旿 111001011010010010011100111010101011101110001010111011001001010010011011111001101000111110010110111011001010001010010001111000101001010010100101111010001011110110000101111010101011001110100000111010101011001010110010111001101011000010111000111010111001110110100000111011001001100010011010111001101011001110100011111010111000110110000011111010111010011010111000111010011001111110110011111011001000111010001100111010111000111110111010111001101001011110111111 e5a49ceabb8aec949be68f96eca291e294a5e8bd85eab3a0eab2b2e6b0b8eb9da0ec989ae6b3a3eb8d83eba6b8e99fb3ec8e8ceb8fbae697bf
UHC 夜껊씛揖좑┥轅고겲永띠옚泣덃릸音쎌돺旿 1110010110101000100000111110101110011101101100001110101111100111101000001110111110100110101111101110101010111111101100001110110110000001101111101110011110110101101101101110110010011110100111101110101111101000100010001110011010010000100101101110101111100101101111011110110010001001101111011110011111111010 e5a883eb9db0ebe7a0efa6beeabfb0ed81bee7b5b6ec9e9eebe888e69096ebe5bdec89bde7fa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)