To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ????????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
EUC-JP ????????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
UTF-8 셔롘렽롊렽샵셔렟렽섧셍섬셔롘렽선셔섧셍섧셔섦셔섹h 11101100100001011001010011101011101000011001100011101011101000001011110111101011101000011000101011101011101000001011110111101100100000111011010111101100100001011001010011101011101000001001111111101011101000001011110111101100100001001010011111101100100001011000110111101100100001001010110011101100100001011001010011101011101000011001100011101011101000001011110111101100100001001010000011101100100001011001010011101100100001001010011111101100100001011000110111101100100001001010011111101100100001011001010011101100100001001010011011101100100001011001010011101100100001001011100101101000 ec8594eba198eba0bdeba18aeba0bdec83b5ec8594eba09feba0bdec84a7ec858dec84acec8594eba198eba0bdec84a0ec8594ec84a7ec858dec84a7ec8594ec84a6ec8594ec84b968
UHC 셔롘렽롊렽샵셔렟렽섧셍섬셔롘렽선셔섧셍섧셔섦셔섹h 10111100110001011000111011011100100011101100010110001110110100001000111011000101101111001010010110111100110001011000111010110000100011101100010110111100101101011011110011000100101111001011011010111100110001011000111011011100100011101100010110111100101100011011110011000101101111001011010110111100110001001011110010110101101111001100010110111100101101001011110011000101101111001011110101101000 bcc58edc8ec58ed08ec5bca5bcc58eb08ec5bcb5bcc4bcb6bcc58edc8ec5bcb1bcc5bcb5bcc4bcb5bcc5bcb4bcc5bcbd68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)